Identification of Signatures of Positive Selection That Have Shaped the Genomic Landscape of South African Pig Populations

Simple Summary Pigs are important in agriculture as they produce animal-based protein for human consumption. The analysis of selection signatures has implications for the maintenance and utilization of genetic diversity and can reveal genes associated with phenotypic traits, either as a result of natural or of artificial selection. Pig populations are poorly characterised in South Africa. Hence, studies aimed at evaluating genetic distinctiveness and pig breed diversity will contribute to developing a rational plan for population conservation programs among other applications. Abstract South Africa boasts a diverse range of pig populations, encompassing intensively raised commercial breeds, as well as indigenous and village pigs reared under low-input production systems. The aim of this study was to investigate how natural and artificial selection have shaped the genomic landscape of South African pig populations sampled from different genetic backgrounds and production systems. For this purpose, the integrated haplotype score (iHS), as well as cross population extended haplotype homozygosity (XP-EHH) and Lewontin and Krakauer’s extension of the Fst statistic based on haplotype information (HapFLK) were utilised. Our results revealed several population-specific signatures of selection associated with the different production systems. The importance of natural selection in village populations was highlighted, as the majority of genomic regions under selection were identified in these populations. Regions under natural and artificial selection causing the distinct genetic footprints of these populations also allow for the identification of genes and pathways that may influence production and adaptation. In the context of intensively raised commercial pig breeds (Large White, Kolbroek, and Windsnyer), the identified regions included quantitative loci (QTLs) associated with economically important traits. For example, meat and carcass QTLs were prevalent in all the populations, showing the potential of village and indigenous populations’ ability to be managed and improved for such traits. Results of this study therefore increase our understanding of the intricate interplay between selection pressures, genomic adaptations, and desirable traits within South African pig populations.


Introduction
Pigs are one of the most important livestock species worldwide.In January 2020, the world population of pigs was estimated to be 677.6 million [1].They are key for livelihoods, food security, and economic growth, especially in developing countries where Animals 2024, 14, 236 2 of 18 they survive under harsh environments and provide for resource-limited households [2].Besides providing proteins for humans, pigs are also used as model animals for research on human diseases [2].
Wild hog species (also referred to as wild pigs) include the warthog (Phacochoreus Africanus), pig deer (Babyrousa Babyrussa), and the pygmy hog (Porcula Salvania), with only wild boar (Sus scrofa) having been domesticated [3,4].Changes in the phenotypic characteristics between domestic and wild pigs are highly noticeable and were driven by natural and artificial selection [5].Independent domestication events from local wild boar in Europe and Asia gave rise to European and East Asian pigs [6,7].As a result of strong artificial selection, there is considerable genetic distance between European and Asian domestic pigs [3,6,7].While commercial lines of European pigs are characterised by an extended body length and lean growth, East Asian domestic pigs have good fat deposition and high reproductive performance [8][9][10].
In the absence of a reproductive barrier between East Asian and European domestic pigs, hybridisation between East Asian and European and later American pig breeds has been successfully used to increase pig production [11][12][13].Previous studies clearly demonstrated a hybrid origin of the European Large White breed with Asian pigs [14].The hybridisation of domesticated pigs with wild boars on European farms has also been used to increase reproduction and genetic diversity in inbred commercial pig lines [15].Village and smallholder pigs that are farmed predominantly under free-range production systems allow for gene flow and introgression, since hybridisation occurs with wild pigs (e.g., warthogs, wild boars and bush pigs) [16,17].Although hybridisation between domesticated and wild pigs can increase production, these events may also have a negative impact on pig production [18].For example, it has been suggested that the outbreak of classical swine fever (CSF) is related to wild and domestic pigs mixing in free-range production systems [19,20].
There is sparse and unreliable information with regards to the history of pig populations in South Africa and other regions of Africa [21][22][23].Indigenous breeds most likely originated from domestic pigs that spread from sub-Saharan Africa to South Africa via the Nile Corridor [22].Commercial pig breeds from Europe and America were also introduced in South Africa for commercial farming in the 1600s by European settlers [21][22][23].While the commercial pig breeds are known for their high performance (e.g., litter size, high growth rate, and meat and carcass quality) [24][25][26], indigenous breeds are well adapted to harsh South African environmental conditions.For example, the indigenous Windsnyer has longer black hair and a thinner epidermis for increased heat tolerance that will shield it against extreme climatic conditions [27,28].The positive characteristics of indigenous and local populations (e.g., heat tolerance and disease resistance) are valuable and need to be characterised and conserved as they are also important to the livelihood of subsistence and small-scale farmers.
In 2013, it was estimated that South Africa had 38,500 commercial farms and 2 million smallholder farmers [29].While commercial pig farmers practise controlled breeding and intense artificial selection for key production traits, small-scale and village farming landscapes are characterised by poorly organised and indiscriminate crossbreeding [30].Commercial farmers mainly use European pig breeds, while pig farmers in rural areas mainly use indigenous breeds (e.g., Kolbroek and Windsnyer).However, rural farmers are increasingly shifting away from indigenous breeds towards the use of commercial exotic breeds [18,26].Crossbreeding between exotic and indigenous breeds have been used to improve performance and production, as well as to increase tolerance and/or resistance to disease and parasites and animals that are hardy and adapted to survive under harsh local conditions [27,[31][32][33].
Adaptation and domestication processes, as well as breed development, can lead to the emergence of signatures of selection in the genomes of pig populations [34].Signatures of selection have been identified in pig populations associated with important traits, such as adaptation to high altitudes [35], muscle growth [36], and body size [10].Genomic sequences of domestic and wild pigs have been observed to be predominantly similar, except in regions under strong selection pressure [10].Various authors have reported on selection in domestic pigs for disease resistance, tolerance, and productivity [37][38][39].Information on selection signatures is valuable and can be used in management strategies to improve production and adaptability.A large-scale analysis of the genetic diversity and structure of South African pig populations relative to global populations (e.g., pigs from South America, Europe, United States, and China) [40] points towards a population that has been shaped by complex evolutionary forces including domestication and continuous interactions between domestic and wild populations.This includes natural and artificial selection under the different production systems as a result of the need to adapt and survive the prevailing climatic conditions, low-input production systems, and diseases.
Many statistical methods are available to identify selection signatures.This includes the integrated haplotype score (iHS) that allows for the detection and characterisation of genomic regions that have experienced selection within a population [41].The iHS identifies regions where a selected allele has risen in frequency quickly due to positive selection, resulting in longer haplotypes around the selected variant.Although iHS methods apply statistical corrections to control for confounding factors such as population structure, demographic history, or genetic drift, it is still sensitive to population structure, potentially leading to false positives.The cross-population extended haplotype homozygosity (XP-EHH) method takes into account differences between two populations [42].Both tests have been shown to have high power in detecting selection signatures even in small sample sizes [43,44].Moreover, the XP-EHH statistic identifies population-specific genetic signatures, by identifying regions where specific alleles or haplotypes have undergone recent positive selection, leading to their rapid increase in frequency in one population but not in another [45].This comparison helps in differentiating between regions affected by local adaptation and those influenced by hitchhiking or a demographic history shared by multiple populations.The third approach, hapFLK, involves Lewontin and Krakauer's extension of the Fst statistic based on haplotype information [43,46].This test measures differences in haplotype frequencies between populations, while accounting for their hierarchical structure, enabling the capturing of population-specific genetic signatures, even in scenarios with limited sample sizes [44,47].As a result, the hapFLK method is a powerful tool for identifying signatures of selection, even in the presence of bottlenecks and migration, while limiting the effects of hitchhiking.
The aim of this research was to identify and characterise genomic regions that display signatures of natural and artificial selection in South African pig populations.For this purpose, commercial, village, indigenous, wild and Vietnamese potbelly pig populations that were previously genotyped were included [40].To improve the statistical power for detecting the selection signatures, we used the iHS, XP-EHH, and hapFLK approaches.Specifically, the iHS was used to identify and characterise signatures of selection in each of the populations, while XP-EHH was used to identify and characterise selection signatures between different pairs of populations.The hapFLK statistical method was used to identify and characterise selection signatures between the multiple populations (i.e., including all the pig populations).

Animal Samples, Genotyping, and Quality Control
In total, 234 animals that were previously genotyped were used in this study [40].This included 60 pigs from commercial farms represented by the Large White (LWT), South African Landrace (SAL) and Duroc (DUR) breeds, 40 indigenous pigs represented by Kolbroek (KOL) and Windsnyer (WIN) breeds, as well as 91 village and non-descript pig populations.The latter were obtained from villages in the Eastern Cape (Alfred Nzo, ALN and O.R. Tambo, ORT) and the Limpopo (Capricorn, CAP and Mopani, MOP) districts.In addition, 5 Vietnamese Potbelly pigs (VIT) from the Johannesburg Zoo and 38 wild pigs represented by the warthog (WAT), wild boar (WBO) and bush pig (BSP) were collected from various game reserves.
The animals were genotyped using PorcineSNP60 v2 BeadChip (Illumina, San Diego, CA, USA) containing 62,163 SNPs with an average gap of 43.4 kb [40].Markers with a call rate lower than 85% and not physically mapped to the S. scrofa 11.2 genome assembly were discarded using Golden Helix SNP Variation Suite (SVS) version 8.8.1.Markers with a minor allele frequency (MAF) lower than 2%, and those that deviated from the Hardy-Weinberg equilibrium (p-value < 0.0001) were also excluded.BEAGLE (version 5.1) was used to phase the autosomal genome using 30 iterations of the phasing algorithm on a 5 Mb chromosomal region and sample haplotype pairs for each individual per iteration for all the data sets used.
All of the pig populations (Table 1) were included in the analyses to detect selection signatures, including the populations consisting of fewer than 10 individuals (BSP, WBO, and VIT).Based on the population diversity and structure results [40], the indigenous pig populations (WIN and KOL) were grouped together (IND, n = 40), the village populations sampled in Limpopo (MOP and CAP) were grouped together (LIM, n = 52), and those in the Eastern Cape (ORT and ALN) were grouped together (EC, n = 39) in the XP-EHH and hapFLK analysis.

Detection of Signatures Using iHS
The iHS was used to screen for non-overlapping regions within a population under positive selection.Plink 1.9 was used to exclude duplicate SNPs and to recode all genotypes using the --allele1234 script.Plink format map and ped files were converted into the fastPHASE format using the recode fastphase script.This generated a fastphase.inpfile that was used in fastPHASE software 1.4.8.This software was then used to estimate missing genotypes and unobserved haplotypes from unphased data for each chromosome.This then created an input file, the fastphase_hapguess_switch.out file, which was used to calculate the iHS.Once phasing was completed, the iHS was calculated on individual sites for possible signatures using the rehh software in the R environment.The absolute unstandardised iHS (uniHS) was identified as the log ratio iHH A ancestral to the derived iHH D allele for each SNP [41].As the standardised iHS scores are roughly distributed normally with mean = 0 and standard deviation = 1, regions with an average iHS score of 3 (three standard deviations above the mean) or above with at least five SNPs ≤ 100 kb were considered candidate regions for selection.Manhattan plots were generated in the R package qqman 2.3.

Detection of Signatures Using XP-EHH
Selective sweeps between populations were detected using XP-EHH, which makes it possible to find selected regions using the genetic distance between adjacent SNPs based on the of EHH model [42].The cross-population EHH (XP-EHH) statistic is similar to Rsb (Robertsonian selection bias) and compares one population to the other using haplotypes [45].Sabeti et al. [42] defined XP-EHH as standardised (unXP-EHH), identified as a mean (unXP-EHH) and standard deviation (unXP-EHH) for each given SNP.The argument was set to pop1, identified as p right XP−EHH relative to pop2 as p le f t XP−EHH .to find regions associated with each population.XP-EHH used the same phased file as the iHS did and therefore the iHS was firstly calculated for each population using the rehh package [47] in R. Regions with an average XP-EHH score of 3 (three standard deviations above the mean) or above with at least five SNPs ≤ 100 kb were considered candidate regions for selection.Manhattan plots were generated in the R package qqman.

Detection of Signatures Using HapFLK
To reveal genetic differentiations in genomic regions subjected to selection from multiple populations, the HapFLK method was employed.This test accounts for the haplotype structure of the population whilst using polymorphic SNPs in ancestral populations.Reynolds distances were calculated using HapFLK 1.3.0software (https://forge-dga.jouy.inra.fr/projects/hapflk) and then converted into a kinship matrix with the HapFLK package in RStudio.A FastPHASE cross-validation procedure was used to determine haplotype diversity [46].In total, 20 clusters with 30 maximisation iterations on a per chromosome basis were used to calculate the HapFLK statistic.A standard normal distribution was calculated at each SNP using p-values.Selected regions were identified using a p-value ≤ 0.001 [48].For this study, the indigenous breed Kolbroek was used as an outgroup.

Annotation and Function Analyses of Identified Genomic Regions
The candidate regions identified using the three different methods (i.e., the iHS, XP-EHH, and HapFLK) were annotated for genes, quantitative trait loci (QTLs) and functional pathways.For this purpose, BioMart on the Ensembl gene database website was used to annotate genes at particular genome coordinates for all selected regions (release 89).A candidate region search and identification was performed within 1 Mb to the left and right of statistically significant SNPs.The current pig genome S. scrofa 11.2 assembly was used to extract gene symbols.Web-based Panther was employed for functional and pathway enrichment analysis.A false discovery rate (FDR) < 0.10 was used to assess the significance of enriched pathways.The pig QTL (Release 46) database was used to align candidate genes to an available QTL.

Detection of Signatures within a Population Using iHS
After quality check, 27,422 SNPs in total were retained for further analysis.The iHS method was used to detect a positive selection within a population, and identified potential genomic regions in all 13 populations included in this study (Figure 1; Table S1: Supplementary Material File S1).
The number of regions identified differed greatly between the different populations ranging from 87 for CAP (Figure 1C) to 4 for BSP (Figure 1L) (Table S2: Supplementary Material File S2).The low numbers of regions identified in the BSP and VIT populations (Figure 1J,L) likely reflect their small sample sizes.Most selection regions were identified among the village pigs, which included the CAP (87 regions), ALN (40 regions), ORT (71 regions), and MOP (68 regions) populations (Figure 1A-D).Fewer selection regions were identified within the indigenous pig population (KOL 17; WIN 22; Figure 1H,I).In contrast to the wild boar population (WBO, 32 regions) (Figure 1M), fewer regions were identified for the warthog (WAT, 10 regions) and bush pig (BSP, 4 regions) populations (Figure 1L,M).While a high number of regions were also identified for the LWT (34 regions) and SAL (31 regions) commercial pig populations, 12 regions were identified for the DUR population (Figure 1G-I).The number of regions identified differed greatly between the different populations ranging from 87 for CAP (Figure 1C) to 4 for BSP (Figure 1L) (Table S2: Supplementary Material File S2).The low numbers of regions identified in the BSP and VIT populations (Figure 1L,J) likely reflect their small sample sizes.Most selection regions were identified among the village pigs, which included the CAP (87 regions), ALN (40 regions), ORT (71 regions), and MOP (68 regions) populations (Figure 1A-D).Fewer selection regions were identified within the indigenous pig population (KOL 17; WIN 22; Figure 1H-I).In contrast to the wild boar population (WBO, 32 regions) (Figure 1M), fewer regions were identified for the warthog (WAT, 10 regions) and bush pig (BSP, 4 regions) populations (Figure 1M-L).While a high number of regions were also identified for the LWT (34 regions) and SAL (31 regions) commercial pig populations, 12 regions were identified for the DUR population (Figure 1G-I).
The regions displaying significant selection were distributed on different chromosomes, harbouring genes associated with different traits (Table S2: Supplementary Material File S2).For example, the region on chromosome 13 (145.36Mbp) displaying the strongest selection signal (iHS score of 24.09) in the WBO population overlapped with a GHRL gene related to weight gain, as well as several QTLs associated with the feed conversion ratio, age at slaughter, average backfat thickness, and average daily gain, amongst others.(Figure 1M; Table 2).Other regions identified using included QTLs associated with intramuscular fat content that included the regions on chromosome 1 (ORT and CAP), chromosome 2 (LWT), chromosome 4 (KOL), chromosome 5 (WBO), chromosome 8 (WIN), chromosome 12 (ORT), and chromosome 14 (CAP), as well as QTLs associated with the number of teats that included the regions on chromosome 7 (ALN, DUR), chromosome 8 (ORT), chromosome 16 (MOP), chromosome 14 (CAP), chromosome 2 (LWT, WBO), and chromosome 14 (LWT) (Table 2).The regions displaying significant selection were distributed on different chromosomes, harbouring genes associated with different traits (Table S2: Supplementary Material File S2).For example, the region on chromosome 13 (145.36Mbp) displaying the strongest selection signal (iHS score of 24.09) in the WBO population overlapped with a GHRL gene related to weight gain, as well as several QTLs associated with the feed conversion ratio, age at slaughter, average backfat thickness, and average daily gain, amongst others.(Figure 1M; Table 2).Other regions identified using included QTLs associated with intramuscular fat content that included the regions on chromosome 1 (ORT and CAP), chromosome 2 (LWT), chromosome 4 (KOL), chromosome 5 (WBO), chromosome 8 (WIN), chromosome 12 (ORT), and chromosome 14 (CAP), as well as QTLs associated with the number of teats that included the regions on chromosome 7 (ALN, DUR), chromosome 8 (ORT), chromosome 16 (MOP), chromosome 14 (CAP), chromosome 2 (LWT, WBO), and chromosome 14 (LWT) (Table 2).

Detection of Selection of Signatures between Populations Using XP-EHH
Several regions that displayed significant evidence of selection were detected between pairs of populations using XP-EHH (Figure 2; Table S3: Supplementary Material File S3).The numbers of regions displaying significant evidence of selection differed greatly between the paired populations.A high number of regions were identified between the commercial population (DUR) paired with the village (EC, 38 regions and LIM, 19 regions), indigenous (IND, 13 regions), and commercial (LWT, 23 regions) populations.Although a high number of regions were identified between the warthog population (WAT) paired with the village (LIM, 34 regions), commercial (LWT, 14 regions), and indigenous (IND, 10 regions) populations, fewer regions were identified between the WAT paired with the wild boar (WBO, 5 regions) population.This was also true for Vietnamese potbelly pigs (VIMs), with a high number of regions identified between VIM paired with the village (LIM, Although a high number of regions were identified between the warthog population (WAT) paired with the village (LIM, 34 regions), commercial (LWT, 14 regions), and indigenous (IND, 10 regions) populations, fewer regions were identified between the WAT paired with the wild boar (WBO, 5 regions) population.This was also true for Vietnamese potbelly pigs (VIMs), with a high number of regions identified between VIM paired with the village (LIM, 14 regions and EC, 10 regions) and indigenous (IND, 11 regions) populations, while only one region was identified between VIM and the commercial (LWT, 1 regions) populations and VIMs paired with the wild boar (WBO, 5 regions) population.Several QTLs and genes occurred in the genomic regions identified using the XP-EHH method (Table 3, Table S4: Supplementary Material File S4).The strongest signal (XP-EHH score of 6.91) was observed for VIT_LIM on chromosome 9 (Figure 2N).Even though this region was not associated with known QTLs, several regions identified with the XP-EHH method were linked with QTLs associated with important traits.For example, the regions on chromosome 1 (166.17

Detection of Selection of Signatures between Populations Using HapFLK
Across all populations, regions displaying significant (p-value ≤ 0.001) evidence of selection were identified on chromosomes 5 and 6 using KOL as an outgroup (Figure 3).

Detection of Selection of Signatures between Populations Using HapFLK
Across all populations, regions displaying significant (p-value ≤ 0.001) evidence of selection were identified on chromosomes 5 and 6 using KOL as an outgroup (Figure 3).In total, 5924 segments displaying significant (p-value < 0.10) evidence of selection were associated with 1179 genes (Table S5: Supplementary Material File S5).The regions on chromosomes 5 and 6 were linked with QTLs associated with intramuscular fat content, litter size, number of teats, as well as age at slaughter, meat to fat ratio, and body weight (Table 4).In total, 5924 segments displaying significant (p-value < 0.10) evidence of selection were associated with 1179 genes (Table S5: Supplementary Material File S5).The regions on chromosomes 5 and 6 were linked with QTLs associated with intramuscular fat content, litter size, number of teats, as well as age at slaughter, meat to fat ratio, and body weight (Table 4).

Genes Identified Using Different Signatures of Selection Methods
The iHS, XP-EHH, and HapFLK methods allowed the detection of the same genomic regions on chromosomes 5 and 6.The iHS method detected the region on chromosome 5 in the ORT, CAP, WIN, KOL, LWT, and MOP populations, while the region on chromosome 6 was detected in DUR, ALN, ORT, and CAP.The XP-EHH method detected the region on chromosome 5 between the DUR_WBO pairing, while the method detected the region on chromosome 6 between the DUR_EC, DUR_WAT, and DUR_LIM pairings.The region on chromosome 5 detected in the DUR_WBO pairing encoded NECAP1 and KCNJ3 genes.The GO terms reported for DUR_WBO included the regulation of ion transmembrane transport, the clathrin vesicle coat, voltage-gated potassium channel activity, the plasma membrane, vesicle-mediated transport, ligand-gated ion channel activity, and potassium transmembrane transport (Table S6: Supplementary Material File S6).These regions also included genes linked to important signalling pathways, namely the G-protein signalling pathway, the GABA-B_receptor_II_signalling pathway, and the muscarinic acetylcholine receptor 2 and 4 signalling pathway (Table S7: Supplementary Material File S7).
The genes located in the region on chromosome 6 included EPHB2, EPB41L3, METTL4, EPHA8, LYPLA2, FUCA1, PNRC2, SRSF10, MYOM3, SLC16A12, PANK1, and PCGF5.It also included important GO terms linked to the cellular response to follicle-stimulating hormone stimuli, the fucose metabolic process, the regulation of cell growth, growth factor binding, carboxylic ester hydrolase activity, and palmitoyl-(protein) hydrolase activity (Table S6: Supplementary Material File S6).Important pathways were found to be related to the dopamine receptor-mediated signalling pathway and Coenzyme A biosynthesis (Table S7: Supplementary Material File S7).

Discussion
To date, this is the first study identifying signatures of selection in South African pig populations from different genetic backgrounds.We included animals from commercial farms and villages, as well as indigenous and wild roaming pigs.The adaptation footprints across these genomic landscapes were evaluated using within and cross-population selection statistics.Although these methods accounted for small population samples, their statistical power was diminished by the small sample sizes used for bush pig, wild boar and Vietnamese potbelly pig populations [49].Nevertheless, a number of genomic regions containing significant evidence of population-specific selection signatures were detected in the case of of wild boar, which we explored further in this study.
The population genomic approach utilised in this study allowed for the identification of genomic regions under natural selection, such as the indigenous Kolbroek and Windsyner, as well as in the wild boar.Specifically, the region on chromosome 5 included a putative KCNJ3 gene, which may be associated with an udder structure in cattle that is typically important for production efficiencies as well as animal health and welfare [50].This region also encoded genes involved in signalling pathways such as muscarinic acetylcholine receptors that are G protein-coupled receptors (GPCRs) playing a key role in regulating many fundamental functions (e.g., motor control, temperature control, control of inflammation, cell growth, and cell proliferation, as well as control of the airways, gastrointestinal and urinary tracts, cardiovascular system, the central nervous system, and eye) [51].The region on chromosome 13 under natural selection in the wild boar population encoded a putative GHRL gene known to regulate growth and development in pigs [52,53].The identification of these regions thus provides an opportunity to elucidate the genetic basis of the adaptive evolution of local wild and indigenous pig populations in the future including larger sample sizes.
Signatures of selection identified in the commercial pig populations included regions associated with traits such as meat and carcass quality.This is expected, as the Large White, Durocs and South African Landrace pigs are bred for meat production [5,10].Because of this strong artificial selection and because the internal mechanism is the selection of genes, genes in these regions associated with meat and carcass quality included CORIN on chromosome 8, TMPRRSS4 on chromosome 9, SLC44A5 on chromosome 6, APBB2 on chromosome 8, TECTA on chromosome 9, LIPA and IDE on chromosome 14, and ITGA2 on chromosome 16.The DECR1 gene on chromosome 4 is associated with cholesterol levels amongst other meat quality and growth traits.Regions associated with meat and carcass quality were also identified among the indigenous breeds.For example, the indigenous Kolbroek and Windsnyer breeds included the JPH1 gene observed on chromosome 4, which has previously been linked to meat and carcass quality in pigs [54,55].Furthermore, Hoffman et al. [56] observed that meat from Kolbroek pigs can be processed into bacon, ham, and chops.This shows that indigenous breeds can also be identified with traits despite their slow growth rate.
Among the genomic regions displaying signatures of selection, some were associated with fatness, an important economic trait in pig farming [57].For example, ITGA11 on chromosome 1 is associated with an obesity index that determines fat deposition in pigs and other animals [40].The genomic regions (chromosomes 5 and 6) identified with all three the statistics were linked with QTLs associated with intra-muscular fat content, meat to fat ratio, and body weight.Regions that are associated with excess fat deposition when fed improved diets [36,58] present an opportunity to genetically improve meat quality in these breeds.A study by Jung et al. [59] and Ren et al. [60] observed that consumers preferred lean pork with high intramuscular fat content.As a result, commercial breeds displayed lower fat levels compared to European and Chinese breeds [61,62].While commercial breeds (e.g., Large White, Duroc, and Landrace) have low levels of fat tissue, European breeds (e.g., Iberian and Mangalica pigs) and Chinese breeds are predisposed to accumulate excess amounts of adipose tissue [62][63][64].Hoffman et al. [65] reported consumers' preference towards meat with a higher lean percentage.Wild boars have low intra-muscular fat and are categorised under game meat that has high protein and iron and that is considered healthier than ordinary pork or beef meat [66][67][68].Since pig breeds vary when it comes to fat tissue deposition with heritability levels being around 0.5 [61,62], obesity indices and intra-muscular fat can be used as potential tools for selecting animals with desirable meat and carcass qualities.For example, SCPEP1 identified in this study regulates body fat content and correlates with intra-muscular fat deposition in pigs [69].
Genomic regions displaying signatures of selection were associated with reproduction traits such as litter size and total number born alive from a sow, semen volume, sperm concentration, sperm motility, etc.For example, PIK3R5 is one of the genes identified in the O.R. Tambo population that influences litter size at birth and the number of piglets born alive [70], which is important as pigs differ greatly in litter size.For example, the wild boar sows an average if 6.6 litters [71] per year versus an average of 14 to 15.3 litters per sow in Large White breeds [72,73], while indigenous breeds such as Kolbroek average at 810 piglets [74].Nowadays, the pig industry in Europe has been yielding 18-20 litters per sow [75].This high litter number has a negative implication on the physiological tolerance for both sows and litters.The good mothering ability and hardiness of sows ensure high survival rates for the litters.Commercial breeds have an advantage being raised in the intensive production system.Several genomic regions contain QTLs associated with the number of teats on chromosomes 1, 2, 5, 6, 14, 16, and 18.The number of teats is an important trait as it ensures that piglets have adequate access to milk from the sow.The number of teats can have effects on the weaning weight of a piglet and a smaller number of teats in a sow reduces piglets' survival rate [76].In commercial breeds such as Large White and Duroc, a sow can have as many as 19 teats [77].Makhanya [78] reported the number of teats to be an average of 10 in indigenous Kolbroek pigs.Various studies have shown that the number of teats is an essential morphological and reproductive trait that has been under selection for many generations in the pig industry [77,79].
A high number of regions that display significant selection were detected in the South African village pig populations.This is similar to what was previously seen for cattle, where Van Hossou et al. [80] also reported a higher number of selection signatures in admixed West African cattle populations in Benin.The presence of more selection signatures in village pig populations compared to that in other populations can be attributed to several factors.One possible explanation is that genetic diversity may provide a broader pool of genetic variants for selection to act upon, resulting in a higher number of selection signatures.Several genes related to health and resistance to parasites were identified in the village populations, which is well in line with the sturdy nature of this breed.This included the APBB2 gene present in regions under selection in the Alfred Nzo, Mopani, and Capricorn populations, which was shown to regulate inflammatory responses during infection with porcine reproductive and respiratory syndrome virus, which is a major respiratory pathogen of pigs [71].The LIPA gene under selection in the Capricorn population may be involved in the response to wounds and inflammations, as well as in the molecular genetic mechanisms affecting fecundity in sheep [72].Village pigs are well adaptable to local harsh conditions, and this makes them important genetic resources that provide new diversity for the improvement of commercial lines.
Another explanation for the high number of regions is the admixture between village pigs and commercial pigs, which could allow for an improvement in economic traits such as reproduction, growth, and carcass traits among village pigs.Crossbreeding with commercial pigs has allowed for the introduction of genetic variants that are advantageous for these traits in not only village pigs but also indigenous pigs.For example, regions displaying selection signatures included genes for meat and carcass quality in pigs in the village (e.g., SCPEP1 and SAMD4A) and indigenous (e.g., JPH1) populations [54,55,69].The admixed genomes that result from the interbreeding of previously isolated populations can carry genetic signatures that resemble signals of positive selection.Therefore, the possibility that some of the genomic selection signatures identified here stem from historical admixture (i.e., they represent the "ghosts" of introgression) and not recent adaptive events could not be discounted [81].Although further research would be needed to distinguish these types of signatures in all of the populations examined, the genetic remnants of past genetic exchange in admixed genomes may represent a valuable source of variation for further selection and/or adaptation [81].

Conclusions
This study identified several regions displaying significant signatures of selection, which are the result of natural and artificial directional selection events that have contributed to the adaptation of breeds to different environments and production systems of these pig populations.These signatures of selection allowed for the identification of the genomic regions and evolutionary processes that have shaped the populations and affect important phenotypic traits.These included traits related to reproduction, production, health, and meat and carcass quality.Meat and carcass QTLs were prevalent in all the populations, showing the potential of village and indigenous populations' ability to be managed and improved for such traits.Our findings also confirm that genetic resources from villages and wild pigs are important for research as they are not influenced by selection when compared to commercial breeds.Additionally, as BeadChip, used in this study, may not be dense enough to fully understand the signatures between domestic and wild pigs, further research based on larger population sizes is required.
Institutional Review Board Statement: Ethical approval for sample collection for this study was attained from the Agricultural Research Council-Irene Animal Ethics Committee (APIEC16/028).Permission was granted by the Department of Agriculture, Forestry and Fisheries to conduct the investigation in accordance with Section 20 of the Animal Diseases ACT of 1984 (ACT No. 35 of 1985).This was needed as domestic and wild pig samples were sampled and therefore precaution was needed to prevent the spread of African swine fever and foot and mouth diseases (12/11/1/1).
Informed Consent Statement: Not applicable.

Figure 1 .
Figure 1.Manhattan plot of the genome-wide distribution of the selection of signatures detected via the iHS across the 18 chromosomes (indicated in different colours) for the village (A-D), indigenous (E,F), commercial (G-I), Vietnamese potbelly (J), and wild pig (K-M) populations.

Figure 1 .
Figure 1.Manhattan plot of the genome-wide distribution of the selection of signatures detected via the iHS across the 18 chromosomes (indicated in different colours) for the village (A-D), indigenous (E,F), commercial (G-I), Vietnamese potbelly (J), and wild pig (K-M) populations.

Figure 2 .
Figure 2. Manhattan plot of the genome-wide distribution of the selection of signatures between populations detected via XP-EHH across the 18 chromosomes (indicated in different colours) for DUR (A-G), WAT (H-L), and VIT (M-Q) populations.

Figure 2 .
Figure 2. Manhattan plot of the genome-wide distribution of the selection of signatures between populations detected via XP-EHH across the 18 chromosomes (indicated in different colours) for DUR (A-G), WAT (H-L), and VIT (M-Q) populations.
Mbp) and chromosome 6 (80.64 Mbp) identified for the commercial population (DUR) paired with the village populations (EC and LIM) are linked with QTLs associated with reproduction, while the region on chromosome 2 (113.82Mbp) identified in the commercial population (DUR) paired with the village population (EC) and commercial population (LWT) is linked with QTLs associated with meat and carcass quality traits.The region identified on chromosome 1 (193.82Mbp) detected in the wild boar population (WBO) paired with the Vietnamese potbelly pig (VIT) and commercial population (DUR) is linked to QTLs associated with key reproduction traits such as litter Animals 2024, 14, 236 9 of 18 size, maternal infanticide, plasma droplet rate, semen volume, sperm concentration, sperm motility, and total number born alive.

Figure 3 .
Figure 3. Manhattan plot for signature of selection of South African pig populations detected via HapFLK across 18 chromosomes (indicated in different colours).

Figure 3 .
Figure 3. Manhattan plot for signature of selection of South African pig populations detected via HapFLK across 18 chromosomes (indicated in different colours).

Table 1 .
Summary of the sampled pig populations.

Table 2 .
Within-population list of genomic regions under selection and candidate genes detected using the iHS method.

Table 3 .
Selected regions and candidate genes detected between pairs of populations using the XP-EHH method.
Average daily gain, Backfat between 3rd and 4th last rib, Birth weight variability, Body weight (end of test), Conductivity 45 min post-mortem, Fat androstenone level, Intramuscular fat content, Time in feeder per day, pH 24 h postmortem (ham), pH 45 min postmortem, Teat number

Table 3 .
Cont.Average daily gain, Backfat between 3rd and 4th last rib, Birth weight variability, Body weight (end of test), Conductivity 45 min post-mortem, Fat androstenone level, Intramuscular fat content, Time in feeder per day, pH 24 h postmortem (ham), pH 45 min postmortem, Teat number Front leg conformation, Gait score (overall), Hind leg conformation, Litter size, Maternal infanticide, Plasma droplet rate, Semen volume, Sperm concentration, Sperm motility, Total number born alive Average daily gain, Backfat between 3rd and 4th last rib, Birth weight variability, Body weight (end of test), Conductivity 45 min post-mortem, Fat androstenone level, Intramuscular fat content, Time in feeder per day, pH 24 h postmortem (ham), pH 45 min postmortem, Teat number Average daily gain, Backfat between 3rd and 4th last rib, Birth weight variability, Body weight (end of test), Conductivity 45 min post-mortem, Fat androstenone level, Intramuscular fat content, Time in feeder per day, pH 24 h postmortem (ham), pH 45 Front leg conformation, Gait score (overall), Hind leg conformation, Litter size, Maternal infanticide, Plasma droplet rate, Semen volume, Sperm concentration, Sperm motility, Total number born alive

Table 4 .
Genomic regions under selection detected via HapFLK methods in South African pigs.